Lag0s

Week Summary

Artificial Intellegence

DALDA enhances data augmentation techniques by leveraging both LLMs and diffusion models to generate semantically rich images.

AlphaChip represents a significant advancement in AI applications for chip design, utilizing reinforcement learning methodologies.

The Statewide Visual Geolocalization project provides resources for implementing visual geolocalization techniques in real-world scenarios.

CaBRNet introduces a framework for developing explainable AI models, addressing reproducibility and fair comparisons.

The BitQ paper proposes a framework for optimizing block floating point precision in deep neural networks for resource-constrained devices.

Commit-0 is an AI coding challenge aimed at rebuilding core Python libraries, emphasizing code quality and testing.

OpenAI

NotebookLM

The impact of AI on labor markets will be gradual, allowing society to adapt while fostering a culture of collaboration and innovation.

AI has the potential to address global challenges like climate change and space colonization, but risks must be managed proactively.

The need for accessible computing infrastructure is crucial to ensure AI benefits everyone and does not lead to inequality.

AI's role as an autonomous assistant in healthcare and technology development is expected to evolve, marking a transition to the Intelligence Age.

Deep learning breakthroughs have positioned AI to resolve complex problems, leading to significant improvements in quality of life.

The integration of AI into daily life promises unprecedented levels of shared prosperity, although wealth alone does not guarantee happiness.

OpenAI

OpenAI Japan's CEO announces GPT Next with 100x the compute load of GPT-4, set for release this year.
Wednesday, September 4, 2024
OpenAI's GPT Next will be trained with approximately 100x the compute load of GPT-4. It will be released later this year. The true estimates suggest some of the compute load changes are also due to algorithmic improvements. The article is in Japanese.
Hi Impact
OpenAI Japan AI
OpenAI launches fine-tuning for GPT-4o with 1 million free training tokens per day until September 23.
Wednesday, August 21, 2024
OpenAI has launched fine-tuning for GPT-4o, allowing developers to customize the model for specific use cases with their own datasets. It is offering 1 million free training tokens per day through September 23.
Hi Impact
OpenAI GPT-4o AI
Redis transitions to dual source-available licensing starting with version 7.4 to sustainably offer its source code.
Thursday, March 21, 2024
All future versions of Redis will be released with source-available licenses starting with Redis 7.4. Redis will no longer be distributed under the three-clause Berkeley Software Distribution and will be instead dual-licensed under the Redis Source Available License and Server Side Public License. The new licenses enable Redis to sustainably provide permissive use of its source code, allowing it to continue to be freely available to developers, customers, and partners through Redis Community Edition as Redis moves on to its next phase of development as a real-time data platform with a unified set of clients, tools, and core product offerings.
Hi Impact
Redis Software Licensing
Neuralink's first human patient, Nolan Arbaugh, successfully controls a computer with his thoughts.
Thursday, March 21, 2024
Neuralink has shared a video of its brain-computer interface in action inside a human patient. The patient, a 29-year-old man named Nolan Arbaugh who is paralyzed from the neck down, is able to control a cursor on a screen using Neuralink's device. He is able to play games online for about eight hours before the implant needs to recharge. Arbaugh reports that his experience with the implant has so far been positive, despite some initial issues. The video is available in the article.
Hi Impact
Neuralink Nolan Arbaugh Brain-Computer Interface
University of Amsterdam scientists use CRISPR to eliminate HIV from cells, still a proof-of-concept.
Thursday, March 21, 2024
Scientists from the University of Amsterdam claim they have successfully eliminated HIV from infected cells using CRISPR gene-editing technology. The work remains a proof-of-concept and will not become a cure for the disease any time soon. The study's findings are still being scrutinized.
Hi Impact
CRISPR Netherlands Health
OpenAI's GPT-5, expected in mid-2024, aims to be a significant upgrade over GPT-4.
Thursday, March 21, 2024
OpenAI is expected to release a major AI model, possibly GPT-5, sometime in mid-2024, likely during the summer. The new model will likely be a multimodel large language model with similar capabilities to GPT-4, but better. OpenAI is still reportedly training the model, after which it will go through internal safety testing to identify any issues before public release. Other issues, besides those that may come up from testing, may delay the launch.
Hi Impact
OpenAI GPT-5 AI
OpenAI's GPT-5 delayed to late 2025, promising Ph.D.-level intelligence.
Monday, June 24, 2024
GPT-5 will be significantly more intelligent than GPT-4. It is still at least a year and a half from release. The model will have advanced memory and reasoning capabilities and Ph.D.-level intelligence in specific tasks. A short video of OpenAI's Chief Technology Officer Mira Murati comparing the different GPTs is available in the article.
Hi Impact
OpenAI GPT-5 Mira Murati Artificial Intelligence
OpenAI introduces GPT-4o, a superior multimodal model with new features.
Tuesday, May 14, 2024
OpenAI has announced a new model called GPT-4o (o is for omni) that is a natively multimodal model, with superior performance to GPT-4 on text and state-of-the-art performance on a variety of modalities. It also announced a new desktop app, a near real-time audio interface, and a variety of improved reasoning features.
Hi Impact
OpenAI GPT-4o
OpenAI releases GPT-4o, an AI model with enhanced capabilities and efficiency.
Tuesday, May 14, 2024
OpenAI has released GPT-4o, an AI model that can reason across audio, vision, and text in real time. Developers can access the GPT-4o API as a text and vision model. It's 2x faster, half the price, and has 5x higher rate limits compared to GPT-4 Turbo.
Hi Impact
GPT-4o
OpenAI
New AI models challenge GPT-4's dominance amid transparency concerns.
Tuesday, March 26, 2024
GPT-4's dominance in AI benchmarks has been challenged by four new models from different vendors, each showing the potential to surpass GPT-4's capabilities. However, concerns arise as, amidst growing legal and ethical considerations, none of these models are open-source or transparent about their training data. The push for models trained on public domain or licensed content continues, highlighting the complexity of creating competitive AI without proprietary data.
Hi Impact
GPT-4
OpenAI's GPT-4o introduces real-time audio-video conversations with emotional AI capabilities.
Tuesday, May 14, 2024
OpenAI announced a new AI model yesterday called GPT-4o that can converse using speech in real time, read emotional cues, and respond to visual input. It will roll out over the next few weeks for free to ChatGPT users and as a service through API. Paid subscribers will have five times the rate limits of free users. The API will feature twice the speed, 50% lower cost, and five times higher rate limits compared to GPT-4 Turbo. A 26-minute long video that introduces GPT-4o and demonstrates its abilities is available in the article.
Hi Impact
OpenAI GPT-4o AI
OpenAI unveils o1, its pioneering model with reasoning capabilities, marking a significant advancement in AI.
Friday, September 13, 2024
OpenAI has released o1 and o1-mini, the first in a series of reasoning models that have been trained to answer more complex questions faster than a human can. The model is better at writing code and solving multistep problems than previous models, but it is more expensive for developers and slower to use than GPT-4o. The release is still in preview to indicate how nascent it is. ChatGPT Plus and Team users should already have access to the model, while Enterprise and Edu users will get access early next week. OpenAI plans to bring o1-mini access to all free users, but it hasn't set a release date yet.
Hi Impact
OpenAI o1 Artificial Intelligence
OpenAI launches GPT-4o mini, a versatile AI model for free users.
Friday, July 19, 2024
OpenAI has launched a new AI model called GPT-4o mini. It is available to free users. The 'o' in the name stands for 'omni', hinting at the model's audio, video, and text capabilities. The model will be made available to ChatGPT Enterprise users next week.
Hi Impact
OpenAI GPT-4o mini AI
OpenAI may have leaked GPT-4.5 Turbo, enhancing speed, accuracy, and scalability.
Monday, March 18, 2024
OpenAI appears to have accidentally published a blog post that was indexed by Bing and DuckDuckGo before it was quickly taken down. The post was an announcement for GPT-4.5 Turbo, a new model that surpasses GPT-4 Turbo in speed, accuracy, and scalability. The cached description mentions a knowledge cutoff date of June 2024 and a 256k context window. OpenAI has not commented on the leak.
Hi Impact
OpenAI GPT-4.5 Turbo
OpenAI's GPT-4o and GPT-4o mini models can now be customized by developers for business use, allowing for enhanced personalization and control.
Wednesday, August 21, 2024
OpenAI's GPT-4o and GPT-4o mini can now be fine-tuned and customized by developers for business use. Developers can now use their own datasets to enhance the model's knowledge base with proprietary information and control how the model responds to specific questions. It costs $25 for every 1 million tokens used to fine-tune GPT-4o - $3 for GPT-4o mini. 1 million tokens is roughly equivalent to 2,500 pages in a standard-size book.
Hi Impact
OpenAI GPT-4o Technology
OpenAI's training and inference costs for ChatGPT could reach $7 billion in 2024, with a potential $5 billion loss.
Monday, July 29, 2024
OpenAI anticipated spending around $4 billion on Microsoft's Azure servers for ChatGPT inference in 2023, potentially causing significant financial losses. Although OpenAI is profiting approximately $2 billion annually from ChatGPT, it may require additional funding within a year to address a projected $5 billion shortfall. It currently utilizes the equivalent of 350,000 Nvidia A100 chip servers, mainly for ChatGPT, with discounted rates from Azure.
Hi Impact
OpenAI ChatGPT Financial
xAI's Grok 2 AI model to launch in August, trained with 20k H100s.
Thursday, July 4, 2024
xAI's next model is slated for August. It reportedly took 20k H100s to train. Grok 3 is rumored for the end of the year and potentially requires 100k H100s.
Hi Impact
xAI Grok 2 AI
OpenAI and Google's new AI models promise real-time multimodal understanding and improved AI assistants.
Thursday, June 20, 2024
OpenAI and Google have introduced advanced AI models that enable real-time multimodal understanding and responses and promise improved AI assistants and innovations in voice agents. OpenAI's GPT-4o boasts double the speed and half the cost of its predecessor, while Google's Gemini 1.5 Flash delivers a significant reduction in latency and cost. Both tech giants are integrating AI across their ecosystems, with OpenAI eyeing consumer markets, which could potentially reach up to a billion users, with its products and partnerships.
Hi Impact
Google Gemini 1.5 Flash AI
OpenAI and Meta tease new AI models GPT-5 and Llama 3 with enhanced reasoning capabilities, amid skepticism.
Tuesday, April 16, 2024
OpenAI and Meta are teasing the next iterations of their AI models, expected to feature enhanced reasoning and planning capabilities. Dubbed GPT-5 and Llama 3, the models aim to advance toward artificial general intelligence, with vague release timelines and application details. The tech community remains skeptical given the history of overhyped AI promises with limited substantive evidence.
Hi Impact
OpenAI GPT-5
Meta Llama 3
OpenAI to present ChatGPT and GPT-4 updates on May 13, possibly launching an AI search engine.
Monday, May 13, 2024
OpenAI has announced a live stream event scheduled for May 13 to present updates related to ChatGPT and GPT-4, possibly including the launch of an AI-powered search engine.
Hi Impact
OpenAI ChatGPT Event
Comparing AI models GPT-4, Claude 3 Opus, and Gemini Advanced for coding, writing, and task streamlining.
Tuesday, March 19, 2024
There are 3 prominent models in the AI landscape: GPT-4, Claude 3 Opus, and Gemini Advanced. Each has distinct characteristics and capabilities, catering to different needs such as coding, writing, or streamlining basic tasks. This article delves into concepts like context windows and agents, highlighting how these features enhance AI systems capabilities. Even though GPT-4 remains the benchmark and standard, new emerging features in other models could make them solid alternatives in the near future depending on the specific use case.
Hi Impact
GPT-4
Claude 3 Opus
Gemini Advanced
AI Models
OpenAI introduces GPT-4o Mini, a smaller and cheaper model than GPT-3.5, with a score of 82 on MMLU.
Friday, July 19, 2024
GPT-4o Mini is a new, smaller, model from OpenAI aimed to replace GPT-3.5. It scores 82 on MMLU, which is reasonable for cheap models.
Hi Impact
OpenAI
GPT-4o Mini
OpenAI's CriticGPT, based on GPT-4, helps catch mistakes in the RLHF process.
Friday, June 28, 2024
OpenAI has trained a model based on GPT-4 that can catch GPT-4 mistakes and help humans spot mistakes during the RLHF process.
Hi Impact
OpenAI CriticGPT AI
GPT-4o by OpenAI enhances human-machine communication, signaling a new era of AI interaction.
Monday, May 27, 2024
GPT-4o, OpenAI's latest AI model, bridges real-time communication between humans and machines, extending capabilities beyond text to include vision and audio. The AI revolution introduces a new wave of human-to-AI and eventual AI-to-AI interactions, likely impacting the dynamics of our social behaviors and business models. As this technology progresses, its effect on human communication will unfold, potentially catalyzing the creation of innovative companies and software solutions.
Hi Impact
GPT-4o
OpenAI
OpenAI's Evan Morikawa delves into the mechanics behind ChatGPT's functionality.
Wednesday, April 24, 2024
In this article, OpenAI's Evan Morikawa provides insights into ChatGPT's inner workings, covering input text processing and tokenization to prediction sampling using large language models. ChatGPT operates by turning tokens into numerical vectors (embeddings), multiplying them by a weight matrix of billions, and selecting the most probable next word. The tech is grounded in extensive pretraining to predict text based on vast internet data.
Hi Impact
OpenAI ChatGPT AI Technology
Google launches Cloud TPU v5p and Axion CPU to advance in the AI hardware race.
Monday, April 15, 2024
Google's new AI chip, Cloud TPU v5p, is now available. It boasts nearly triple the training speed for large language models compared to its predecessor, TPU v4. This release underscores Google's position in the AI hardware race alongside competitors like Nvidia. Google has also introduced the Google Axion CPU, based on Arm's chip infrastructure, promising better performance and energy efficiency.
Hi Impact
Google Cloud TPU v5p
Google Axion CPU
Nvidia
Microsoft
Amazon
AI Hardware
GPT-4's ability to exploit security vulnerabilities from CVE advisories highlights its advanced capabilities.
Monday, April 22, 2024
Researchers have demonstrated that OpenAI's GPT-4 model can autonomously exploit security vulnerabilities detailed in CVE advisories with an 87% success rate, far outperforming other models and tools like vulnerability scanners.
Hi Impact
OpenAI GPT-4 Security
ChatGPT's performance in code generation varies widely based on task and language.
Tuesday, July 9, 2024
OpenAI's ChatGPT has varied performance in code generation, with success rates ranging from less than 1% to 89% depending on factors like task difficulty and programming language.
Hi Impact
OpenAI ChatGPT AI
Elon Musk's xAI is developing a supercomputer for its next-gen Grok AI chatbot.
Monday, May 27, 2024
xAI is planning to build a supercomputer to power the next version of its Grok AI chatbot. Elon Musk says he wants to get the proposed supercomputer running by the fall of 2025. xAI could be partnering with Oracle for the project. When completed, the supercomputer will be at least four times the size of the biggest GPU clusters that exist today. Musk estimates that running the Grok 3 model and beyond will require 100,000 Nvidia H100 chips.
Hi Impact
xAI Grok AI chatbot Elon Musk Technology
OpenAI focuses on enhancing DALL-E 3 over GPT-4o, highlighting AI visual tech evolution.
Tuesday, June 25, 2024
Despite GPT-4o's advanced imaging capabilities, OpenAI is still actively enhancing DALL-E 3, focusing on refining text rendering and visual accuracy. Facing stiff competition from Midjourney and Ideogram, OpenAI's strategy underscores the continuous evolution and challenges in AI-driven visual technologies.
Hi Impact
OpenAI DALL-E 3

Month Summary

Artificial Intellegence

Intel unveiled its Core Ultra 200V lineup, promising superior AI performance and efficiency for thin laptops.

Alibaba Cloud launched Qwen2-VL, a vision-language model with enhanced capabilities for visual understanding and multilingual processing.

Google Photos introduced an AI-powered search feature, allowing users to search photos using complex natural language queries.

OpenAI is considering high subscription prices for its upcoming large language models, indicating a shift in its pricing strategy.

Google is providing AI-written summaries for news articles in search results, impacting publisher visibility and SEO strategies.

You.com

A new technique for overcoming overfitting in Vision Mamba models was introduced, allowing for scaling up to 300M parameters.

A report warns that generative AI models may struggle due to restrictions on crawler bots, leading to reliance on lower-quality data.

Anthropic released starter projects for scalable customer service agents powered by Claude, collaborating with former AI heads from major companies.

OpenAI's upcoming GPT Next will be trained with 100 times the compute load of GPT-4, with a release expected later this year.

Nvidia's new Blackwell chip achieved top performance in MLPerf's LLM Q&A benchmark, while competitors like AMD and Untether AI also showed strong results.

xAI has launched the world's largest training cluster, the 100,000 Colossus H100, with plans to double its size soon.

Nearly 200 Google DeepMind employees urged the company to end military contracts, citing ethical concerns regarding AI use.

Apple is exploring robotics, potentially introducing devices like an iPad on a robotic arm, with a projected release in 2026 or 2027.

OpenAI's Command R and Command R+ models received upgrades, improving recall, speed, math, and reasoning capabilities.